Distance Bounds for Convolutional Codes and Some Optimal 

Codes 



Heide Gluesing-Luerssen* and Wiland Schmale* 
February 1, 2008 



Abstract 

After a discussion of the Griesmer and Heller bound for the distance of a convolutional 
code we present several codes with various parameters, over various fields, and meeting 
the given distance bounds. Moreover, the Griesmer bound is used for deriving a lower 
bound for the field size of an MDS convolutional code and examples are presented showing 
that, in most cases, the lower bound is tight. Most of the examples in this paper are cyclic 
convolutional codes in a generalized sense as it has been introduced in the seventies. A 
brief introduction to this promising type of cyclicity is given at the end of the paper in 
order to make the examples more transparent. 
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1 Introduction 

The fundamental task of coding theory is the construction of good codes, that is, codes having 
a large distance and a fast decoding algorithm. This task applies equally well to block codes 
and convolutional codes. Yet, the state of the art is totally different for these two classes of 
codes. The mathematical theory of block codes is highly developed and has produced many 
sophisticated classes of codes, some of which, like BCH-codes, also come with an efficient 
decoding algorithm. On the other hand, the mathematical theory of convolutional codes is 
still in the beginnings. Engineers make use of these codes since decades, but all convolutional 
codes used in practice have been found by systematic computer search and their distances 
have been found by computer as well, see for instance |12j and Sec. 8] for codes having 
the largest distance among all codes with the same parameters. Moreover, in all practical 
situations decoding of convolutional codes is done by search algorithms, for instance the 
Viterbi algorithm or one of the sequential decoding algorithms, e. g. the stack algorithm. It 
depends on the algorithm how complex a code may be without exceeding the range of the 
decoding algorithms. However, the important fact about the theory of convolutional codes is 
that so far no specific codes are known that allow an algebraic decoding (in the present paper 
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a decoding algorithm will be called algebraic if it is capable to exploit the specific structure 
of the given code in order to avoid a full search). 

Since the seventies quite some effort has been made in order to find algebraic constructions of 
convolutional codes that guarantee a large (free) distance ^Ql El EH1 H] ■ The drawbacks 
of all these constructions are that, firstly, the field size has to be adapted and in general 
becomes quite large and, secondly, so far no algebraic decoding for these codes is known. A 
main feature of most of these constructions is that they make use of cyclic block codes in 
order to derive the desired convolutional code. 

Parallel to these considerations there was an independent investigation of convolutional codes 
that have a cyclic structure themselves, which also began in the seventies ^Hl E3 El 13] ■ It was 
the goal of these papers to see whether this additional structure has, just like for block codes, 
some benefit for the error-correcting capability of the code. The first and very important 
observation of the seventies was the fact that a convolutional code which is cyclic in the 
usual sense is a block code. This negative insight has led to a more complex notion of 
cyclicity for convolutional codes. The algebraic analysis of these codes has been completed 
only recently in jS] and yields a nice, yet nontrivial, generalization of the algebraic situation 
for cyclic block codes. Furthermore, by now plenty of optimal cyclic convolutional codes 
have been found in the sense that their (free) distance reaches the Griesmer bound. To the 
best of our knowledge it was, for most cases of the parameters, not known before whether 
such optimal codes existed. Many of these codes are over small fields (like the binary field) 
and are therefore well-suited for the existing decoding algorithms. Along with the algebraic 
theory of ..5_ all this indicates that this notion of cyclicity is not only the appropriate one for 
convolutional codes but also a very promising one. Yet, the theory of these codes is still in 
the beginnings. So far, no theoretical results concerning the distance of such a code or its 
decoding properties are known. But we are convinced that this class of codes deserves further 
investigation and that the theory developed so far will be a good basis for the next steps. 

It is the aim of this paper to present many of these examples in order to introduce the 
class of cyclic convolutional codes to the convolutional coding community. The examples 
are presented via a generator matrix so that no knowledge about cyclicity for convolutional 
codes is required from the reader. The (free) distances of all these codes have been obtained 
by a computer program. A detailed discussion of various distance bounds for convolutional 
codes over arbitrary fields shows that all the given codes are optimal with respect to their 
distance. It is beyond the scope of this paper to acquaint the reader with the theory of cyclic 
convolutional codes. However, in Section E] we will give a very brief introduction into this 
subject so that the reader may see how the examples have been constructed. The details of 
the theory can be found in p]. 

The outline of the paper is as follows. After reviewing the main notions of convolutional 
coding theory in the next section we will discuss in Section |3] various bounds for the free 
distance of a convolutional code, the Griesmer bound, the Heller bound and the generalized 
Singleton bound. The first two bounds are well-known for binary convolutional codes and 
can straightforwardly be generalized to codes over arbitrary fields. It is also shown that 
for all sets of parameters the Griesmer bound is at least as good as the Heller bound. The 
generalized Singleton bound is an upper bound for the free distance of a code of given length, 
dimension, and complexity, but over an arbitrary field. Just like for block codes a code 
reaching this bound is called an MDS code The Griesmer bound is used for showing 

how large the field size has to be in order to allow for an MDS code. In Section 0] many 
examples of codes are presented reaching the respective bound. Most of these examples are 
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cyclic convolutional codes, but we also include some other codes with the purpose to exhibit 
certain features of convolutional codes. For instance, we give examples of MDS codes showing 
that the lower bounds for the field size as derived in Section |3] are tight. Furthermore, an 
example is given showing that a code reaching the Griesmer bound may have extreme Forney 
indices, a phenomenon that does not occur for MDS codes. The paper concludes with a brief 
account of cyclicity for convolutional codes. 



2 Preliminaries 



We will make use of the following notation. The symbol F stands for any finite field while ¥ q 
always denotes a field with q elements. The ring of polynomials and the field of formal 
Laurent series over F are given by 



Ffzl 



N 

J2fr 



3=0 



oo 

N G No, fj G F} and F((z)) = { £ f jZ j 1 1 G Z, fj G f} 



The following definition of a convolutional code is standard. 

Definition 2.1 Let F = F g be a field with q elements. An (n,k, 5) q - convolutional code is a 
A;-dimensional subspace C of the vector space F((z)) n of the form 

C = imG:= {uG\ue¥((z)) k } 

where G G F[z] fcxn satisfies 

(a) G is right invertible, i. e. there exists some matrix G G F[z] nxfc such that GG = Ik- 
lb) 5 = max{deg7 | 7 is a /c-minor of G}. 

We call G a generator matrix and 5 the complexity of the code C. 

The complexity is also known as the overall constraint length p. 55] or the degree |16l 
Def. 3.5] of the code. Notice that a generator matrix is always polynomial and has a poly- 
nomial right inverse. This implies that in the situation of Definition 12.11 the polynomial 
codewords belong to polynomial messages, i. e. 

CD¥[z] n = {uG\ue¥[z] k }. (2.1) 

In other words, the generator matrix is delay-free and non-catastrophic. As a consequence, 
a convolutional code is always uniquely determined by its polynomial part. Precisely, if 
C = imG and C = imG' where G, G' G F[z] fcxn are right invertible, then 

C = C ^Cn¥[z] n = C'n¥[z] n . (2.2) 

This follows from (|2~H) and the fact that {uG \ u e ¥[z] k } = {uG' \ u £ ¥[z} k } is equivalent 
to G' = VG for some matrix V G F[z] fcxfc that is invertible over ¥[z]. This also shows that 
the complexity of a code does not depend on the choice of the generator matrix. From all 
this it should have become clear that with respect to code construction there is no difference 
whether one works in the context of infinite message and codeword sequences (Laurent series) 
or finite ones (polynomials) as long as one considers right invertible generator matrices. Only 
for decoding it becomes important whether or not one may assume the sent codeword to be 
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finite. The issue whether convolutional coding theory should be based on finite or infinite 
message sequences, has first been raised and discussed in detail in [2*T1 l2*Uj . 

It is well-known [2J Thm. 5] or p. 495] that each convolutional code has a minimal generator 
matrix in the sense of the next definition. In the same paper [21 Sec. 4] it has been shown 
how to derive such a matrix from a given generator matrix in a constructive way. 

Definition 2.2 (1) For v = J2jL v j zJ G ¥ \ Z T where v j G F " and V N / let degv := N be 
the degree of v. Moreover, put degO = — oo. 

(2) Let G G F[z] fcxn be a right invertible matrix with complexity 5 = max{deg7 | 7 is a 
/c-minor of G} and let v%, . . . , be the degrees of the rows of G in the sense of (1). We 
say that G is minimal if 5 = J2i=i u i- ^ n this case, the row degrees of G are uniquely 
determined by the code C := imG C F((,s)) n . They are called the Forney indices of C and 
the number max{z^i,... , Vf.} is said to be the memory of the code. An (n, k, 8) q -code 
with memory m is also called an (n, k, 6; m) q -code. 

From the above it follows that an (n, k, <5) 9 -convolutional code has a constant generator matrix 
if and only if 5 = 0. In that case the code can be regarded as an (n, fe)q-block code. 

The definition of the distance of a convolutional code is straightforward. For a constant 
vector w = (w\, . . . ,w n ) G F n we define its (Hamming) weight as wt(w) = ^{i \ Wi 7^ 0}. 
For a polynomial vector v = Y^f=o v j z '' ^ F[z] n , where Vj G F n , the weight is defined as 
wt(-u) = X]^=o w t( u j)- Then the (free) distance of a code C C F((z)) n with generator matrix 
G G F[z] fcxn is given as 

dist(C) := min {wt(u) I v eCnF[z] n , jj^O}. 

By virtue of (|2.1|) this can be rephrased as dist(C) = min{wt(uG) [ u G F[z] fc , u ^ 0}. 

When presenting some optimal codes in Section0]we will also investigate the column distances 
of the codes. For each / G No the lib. column distance of C is defined as 

d\ = min |wt((nG) [0i /]) u G ¥[z} k , u ± o} (2.3) 

where for a polynomial vector v = Ylf=o v j z '' we define U[ 0) /] = X^fco^'^ v j zj '■ ^ can easily 
be shown [2l Thm. 3.4] that for each code C there exists some M G No such that 

d c < di < d\ . . . < d c M = d c M+1 = ... = dist(C). (2.4) 



3 Distance Bounds 

In this section we want to present some upper bounds for the distance of a convolutional 
code. These bounds are quite standard for binary convolutional codes and can be found 
in Chapter 3.5 of the book 0. The proof for arbitrary fields goes along the same lines of 
arguments, but for sake of completeness we wish to repeat the arguments in this paper. We 
will also compare the numerical values of the bounds with each other. 

Let us begin with recalling various distance bounds for block codes. The Plotkin bound as 
given below can be found in 1.4.3], but can also easily be derived from the more familiar 
formula 

if d > 9n where 6 = ^— ^ then q k < , - ' , (3.1) 
q d — On 
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see for instance |131 (5.2.4)]. As for the Singleton and the Griesmer bound we also refer 
to [El Ch. 5.2]. 



Theorem 3.1 Let C C F n be an (n, k) q -block code and let d = dist(C). Then 

d < n — k + 1 (Singleton bound), 



d < 



k-l 



nq k 1 (q — 1) 



1 



< n 



(Plotkin bound), 
(Griesmer bound). 



1=0 



An (n, k) q -code C with dist(C) = n — k + 1 is called an MDS code. 

Notice that the Singleton bound does not take the field size into account. As a consequence 
the question arises as to how large the field size q has to be in order to allow the existence of 
MDS codes and how to construct such codes. Answers in this direction can be found in |14| 
Ch. 11]. 

It is certainly well-known that the Griesmer bound is at least as good as the Plotkin bound. 
The importance of the Plotkin bound, however, is that it also applies to nonlinear block 
codes, in which case it is usually given as in and with M := \C\ instead of q k . Since we 
did not find a comparison of the two bounds for linear block codes in the literature we wish 
to present a short proof of this statement. We also include the relation between the Griesmer 
and the Singleton bound. 

Proposition 3.2 Given the parameters n, k, d, and q £ N where k < n and q is a prime 
power. Assume ^z^q 1 ^ — n - Then 



(a) d< 



nq (q — 1) 



q k - 1 
(b) d < n - k + 1. 



There is no relation between the Plotkin and the Singleton bound in this generality. Roughly 
speaking, for relatively large values of q the Singleton bound is better than the Plotkin bound 
while for small values the Plotkin bound is better. 



Proof: (a) Assume to the contrary that d > |_— 



Since d is an integer this implies 



that d > 



^zll Thus 

d 



-1 



k-l 



1=0 



E 1 d n{q — 1) 

~J > 2^ q k _ i 

l=o H i=o H 



k-l-l 



n(q - 1] 



k-l 
\ ^ j 



1 ;=o 



n. 



□ 



(b) follows from ^f^ 1 [4] >d + k — 1. 

One should also recall that the Griesmer bound is not tight. An example is given by the 
parameters n = 13, k = 6, q = 2 in which case the Griesmer bound shows that the distance 
is upper bounded by 5. But it is known that no (13, 6)2-code with distance 5 exists, see 
p. 69]. 

We will now present the generalization of these bounds to convolutional codes. Let us begin 
with the Singleton bound. The following result has been proven in |22[ Thm. 2.2]. 
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Theorem 3.3 Let C C ¥((z)) n be an (n,k,S)-code. Then 
(a) The distance of C satisfies 



dist(C) <(»-*:)( - + l) + 5 + 1 =: 5(n, fc, 5). 



Tie number S(n, k, 5) is called the generalized Singleton bound for the parameters 
(n, k, S) and we call the code C an MDS code if dist(C) = S(n, k, 5). 

(b) If C is an MDS code and 5 = ak + r where a € No and < r < k — 1, then the Forney 
indices of C are given by 

a, . . . , a, a + 1, . . . , a + 1 . 

k—r times r times 

Hence the code is compact in the sense of j!6'l Cor. 4.3]. 

Just like for block codes the acronym MDS stands for maximum distance separable. In |22l 
Thm. 2.10] it has been shown that for all given parameters n, k, 5 and all primes p there exists 
an MDS code over a suitably large field of characteristic p. The proof is non-constructive 
and, as a consequence, does not give a hint about the field size required. In [23 Thm. 3.3] a 
construction of (n, k, <5)-MDS codes over fields ¥ p r is given under the condition that n\(p r — 1) 
and p r > ■ Notice that this requires n and the characteristic p being coprime. This 

result gives first information about the field size required in order to guarantee the existence 
of an MDS code. However, many examples of MDS codes over smaller fields are known. We 
will present some of them in the next section. Although they all have a certain structure in 
common (they are cyclic in the sense of Section |SJ) we do not know any general construction 
for cyclic MDS codes yet. 

Now we proceed with a generalization of the Plotkin and Griesmer bound to convolutional 
codes. 

Theorem 3.4 Let C be an (n,k,S;m) q -convolutional code having distance dist(C) = d. 
Moreover, let 

' N := {1,2,...}, if km = 5 
N := {0,1,2,...}, if km > 5 



N 



Then 



d < min 



n(m + i)qk{m+i)S-l ^ _ -g 



fc(m+i)— 5— 1 



: H q (n, k, 5; m) (Heller bound) 



d 



< max G {1, . . . ,S(n, k, <5)} | Yj \~] <n(m + i) for alii eN} 
=: G q (n, k, 5; m) (Griesmer bound) 



Moreover, G q (n,k,6;m) < H q (n,k,6;m). 

In the binary case (q = 2) both bounds can be found in 9, 3.17 and 3.22]. In that version the 
first bound has been proven first by Heller in [2] . The Griesmer bound as given above differs 
slightly from the one given at 3.22]. We have upper bounded the possible values for d! 
by the generalized Singleton bound, which is certainly reasonable to do. As a consequence, 
the Griesmer bound is always less than or equal to the generalized Singleton bound. This 
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would not have been the case had we taken the maximum over all d! £ N. This can be seen 
by taking the parameters (n, k, S;m) q = (5,2, 3;3)g. In this case the generalized Singleton 
bound is S(n, k, 5) = 10 but the inequalities of the Griesmer bound are all satisfied for the 
value d! = 12. 

The proof of the inequalities above is based on the same idea as in the binary case as we will 
show now. 

Proof: The last statement follows from Proposition I3.2f a). As for the bounds themselves 
we will see that they are based on certain block codes which appear as subsets of the given 
convolutional code C. This will make it possible to apply the block code bounds of Theo- 
rem ^. II The subcodes to be considered are simply the subsets of all codewords corresponding 
to polynomial messages with an upper bounded degree. 

Let C = imG, where G £ F[z] fcxra is right-invertible and minimal with Forney indices 
v%, . . . , V}.. Hence 5 = Yli=i v i an d m = maxj^i, . . . , v^}. Notice that km > 5 and km = 
5 <^=^> v\ = . . . = fk = m. For each i £ No define 

Ui = {(u\, . . . , £ ¥[z] k | degii/ < m + i — 1 — v\ for I = 1, . . . , k}. 

This implies u\ = if V[ = m and i = 0. In particular, U{ = {0} <J=^> km = 5 and i = 
and this shows that i = has to be excluded if km = 5. Obviously, the set Ui is an F-vector 
space and dimp U = ^z=i( m + * — v l) = k{m + i) — 5. Consider now d := {uG \ u £ U} for 
i £ No- Then d C C and Cj is an F-vector space and, by injectivity of G, 

diniF d = dimp Ui = k{m + i) — 8. 

Furthermore, minimality of the generator matrix G tells us that 

deg(uG) = max (deg u\ + v{) < m + % — 1 for all u £ Ui, 

1=1,. ..,k 

see p. 495]. Hence Cj can be regarded as a block code of length n(m + i) and dimension 
k(m + %) — 5 for all i £ N. Since dist(C) < dist(Cj) for alii € N we obtain the desired results 
by applying the Plotkin and Griesmer bounds of Theorem 13. II to the codes Cj. □ 

The proof shows that the existence of an (n, k, 5; m) q -code meeting the Griesmer bound 
implies the existence of (n(m + i), k(m + i) — <5) g -block codes having at least the same distance 
for all i £ N. The converse, however, is not true, since the block codes have to have some 
additional structure. We will come back to this at the end of this section. 

One should note that these bounds do only take the largest Forney index, the memory, into 
account. More precisely, the proof shows that codewords having degree smaller than m — 1 
are never taken into consideration. As a consequence, codes with a rather bad distribution 
of the Forney indices will never attain the bound. For instance, for a code with parameters 
(n, k, 5; m) q = (5, 3, 4; 2)2 the Griesmer bound shows that the distance is upper bounded by 6. 
This can certainly never be attained if the Forney indices of that code are given by 0, 2, 2 
since in that case a constant codeword exists. Hence the Forney indices have to be 1, 1,2. 
In this case a code with distance 6 does indeed exist, see the first code given in Table I of 
Sectional But also note that, on the other hand, a code reaching the Griesmer bound need 
not be compact (see Theorem I3.3f b)): an example is given by the (5, 2, 6; 4)2-code given in 
Table I of the next section. 

The Griesmer bound as given above has the disadvantage that infinitely many inequalities 
have to be considered. A simple way to reduce this to finitely many inequalities is obtained 
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by making use of the generalized Singleton bound S(n, k, 5). Instead of this bound one could 
equally well use any of the numbers occurring on the right hand side of the Heller bound. 

Proposition 3.5 Given the parameters n, k, m, S such that k < n and km > 5 and let q be 
any prime power. Define the set N as in Theorem \3.4l Furthermore, let iq G N be such that 
q k(m+i )-5 > ^ S j and put £^ := n n {0, 1, . . . , to}. Then 

k(m+i)— 8— 1 , 

G q (n, k, 6; m) = max [d> G {1, . . . , S(n, k,S)}\ £ \ d -] < n(m + i) for all i G N<, } . 

z=o q 

(3.2) 

Hence the distance of an (n, k, 5; m) q -code is upper bounded by the number given in ()3.2j) . 

We will see in the next section that the Griesmer bound is tight for many sets of parameters. 

Proof: Notice that for a > S(n,k,5) we have |~^] = 1 since d' < S(n,k,5). As for (|3~2^1 

it suffices to show that whenever dl satisfies the inequality Yl^^o^ 6 1 |"^r] ^ n(m + i) for 
some i > io, then it also satisfies the inequality for i + But this follows easily from 



fc(m+i+l)— 5-1 „ k(m+i)-8-l v fc(m+i+l)<5-l 



E 

1=0 



df 



E 



+ E 

l=k(m+i)— 8 



< n(m + i) + k < n(m + i + □ 



The finite sets for d' and i in (j3,2j) are not optimized, but they are good enough for our 
purposes since they allow for a computation of the Griesmer bound in finitely many steps. 
Unfortunately 1)3.2(1 does not reveal the block code case where only the index i = 1 has to be 
considered according to Theorem 13. 11 The consistency of the Griesmer bound for m = 5 = 
with that case is guaranteed by the following result. 



Proposition 3.6 Given the parameters n, k, and q. Then 



ki-l 



max |d' G N | ^ |~^- < ni for all i G n| = max |d' G N | ^ |~^y < raj. 



fc-i 



df 



1=0 



Proof: Let d! be any number satisfying Y^=o — n - ^ e nave to show that df satisfies 
the inequalities given on the left hand side for all % G N. In order to do so, notice that 
according to Proposition 13.21 a) 



d'< 



nq 



k-l 



(9-1) 



< 



nq 



k-l 



1 + 0+...+ 



-.k-l 



< 



n 



k-l 



But this implies 4 < | for all / > k, thus < f and 



ki-l 



E[5l-E[51 + E[ 



fc-i 



d!_- 



ki-l 



df 



i=k 



n 



< n + k(i — 1)— = ni. 

k 



This proves the assertion. 



□ 



Finally we want to investigate as to how big the field size q has to be in order to allow for an 
MDS code with parameters (ra, k, S) q . A first estimate can be achieved by using the Griesmer 
bound in combination with the generalized Singleton bound. 
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Theorem 3.7 Let C C ¥((z)) n be an (n, fc, 5; m) q -MDS code, thus d := dist(C) = 5(n, fe, 5) 
(n - fc)([|J + l) + -5 + 1. Then tie field size o satisfies 



> 



n _ k+l , if [k = 1} or [k > 1 and km = 5 + 1] 
d, if [k > 1 and /cm ^ 5 + 1] . 



The estimate above also covers the block code case as given in |14| p. 321]. 

Proof: We will consider the various cases separately. In each case we will apply the inequal- 
ity 



d 



k(m+i)— 6— 1 



d 



— < n(m + i) — d — — , (3-3) 

q 1=2 q 

which is a simple consequence of the Griesmer bound, to the case d = S(n,k,5). Moreover 
we will make use of the fact that |~4] > 1 for all I € N. 

k = 1: In this case m = 5 and d = n(m + 1). Since k(m + i) — 5 — 1 = i — 1 Inequality ()3,3|) 
gives us 

— < n(m + i) — n(m + 1) — (i — 2) = n(i — 1) — i + 2 

q 

for all i > 2. This shows g > ^ as desired. Using i = 1 in the Griesmer bound simply leads 
to c£ < n(m + 1). This is true by assumption and gives no further condition on q. 
k > 1 and km = 5: Now m = f and thus d = (n—k)(m+l)+mk+l. Using k(m+i)—5—l = 
ki — 1 we obtain from Inequality (|3.3|) 

— < n(m + i) — (n — k)(m + 1) — mk — 1 — (ki — 2) = (n — k)(i — 1) + 1 

for all i > 1. Using i = 1 leads to q > d. 

k > 1 and fcm > (5: In this case m = |_^J + 1, see Theorem l3.3f b). and d = (n — k)m + 5+1. 
Therefore Inequality ()3.3|) leads to 

— < n[m + i) — (n — k)m — 5 — 1 — {k(m + i) — 5 — 2) = (n — k)i + 1 

for all i > 1. This shows q > • In order to finish the proof we have to consider also 

i = 0. In the case km = 5 + 1 the Griesmer bound applied to i = simply leads to d < nm, 
which is true anyway, and no additional condition on q arises. If km — 5 > 1 a better bound 
can be achieved. Since I |J = m — 1, we obtain after division with remainder of 5 by k an 
identity of the form 5 = (m — l)k + r where < r < k — 1. Thus d = nm — k + r + 1 and 
Inequality (|3.3|) for i = leads to 

d . k ^\d 



— < nm — d 



E [J <fc-r-l-(fc 



hence q > d. 

This covers all cases, since we always have km > 5. □ 

The proof shows that in general the lower bounds on q are not tight since we have estimated 
r^"J by 1 for / > 2 in all cases. For instance, if (n — k + l) 2 > d, no (n, k, 5; m) g -MDS code 

exists for q = and k = 1 or km = 5 + 1. But even if |~-4] = 1 f° r all Z > 2 there might 
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not exist an (n, k, <5)„-MDS code where q attains the lower bound. The obstacle is that for 
some i £ N there might not exist an (n(m + i), k{m + i) — 5) g -block code with the appropriate 
distance as required by the proof of Theorem 13 .41 Since these block codes have to produce a 
convolutional code in a very specific way, they even have to have some additional structure. 
We wish to illustrate this by the following example. 

Example 3.8 Let (n, k, 5) = (3, 2, 3). The generalized Singleton bound is d := 5(3, 2, 3) = 6 
and the memory of a (3, 2, 3)-MDS code is m = 2, see Theorem l3.3f b). From Theorem 13 . 71 we 
obtain q > 3 for the field size. Taking g = 3we have |~-^] = 1 so that indeed the lower bound 
for the field size cannot be improved. The existence of a (3, 2, 3; 2)3-MDS code requires the 
existence of (3(2 + i), 1 + 2i)3-block codes with distance at least 6 for all i £ No- Such codes 
do indeed exist 1 . However, the block codes have to have some additional structure in order to 
be part of a convolutional code. To see this, let G € F3[z] 2x3 be a minimal generator matrix 
of the desired convolutional code C. Write 



G 



+ z 



where gi € F§. 



Recall from the proof of Theorem 13 . 41 that our arguments are based in particular on the block 

Comparing like powers of z one observes that 



code C\ := {(ui,u 2 + u 3 z)G \ ui,U2,u 3 G F 3 }. 
this code is isomorphic to 

91 93 
Ci = im g 2 54 
.0 g 2 



95 



94 



CF§\ 



Using elementary row operations on the polynomial matrix G we may assume that the entry 
of G at the position (1, 1) is a constant. Furthermore, after rescaling the columns of G we 
may assume 54 = (1, 1, 1). Finally, due to non-catastrophicity, the entries of g 2 are not all the 
same and because of dist(Ci) = 6, all nonzero. This gives us (up to block code equivalence) 
the two options 



1111 



a\ a 2 03 04 as a6 aj 
112 1110 
1 1 2 1 1 1 



or 



1111 



fli 02 ^3 04 as 06 Ol-j 
1 2 2 1 1 1 
1 2 2 1 1 1 



for C\. Going through some tedious calculations one can show that no such code in F3 with 
distance 6 exists. Hence no (3, 2, 3)3-MDS convolutional code exists. 



In the next section we will give examples of MDS codes over fields ¥ q where q attains the 
lower bound in all cases except for the case km = 5 + 1. 



4 Examples of Some Optimal Convolutional Codes 

In this section we present some convolutional codes with distance reaching the Griesmer 
bound. To the best of our knowledge it was for most of the parameters, if not all, not known 
before whether such codes existed. 

1 For small i these codes can be found in tables listing ternary codes. For the general case we wish to thank 
H.-G. Quebbemann who pointed out to us a construction of such codes for sufficiently large i using direct 
products of finitely many "short" MDS-codes over F33 and mapping them into ternary codes. 
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In the first column of the tables below the parameters of the given code are listed. In the 
second column we give the Griesmer bound g := G q (n,k,S;m) for these parameters. The 
third column gives a code reaching this bound. In all examples the distance of the code 
has been computed via a program. In each case the code is given by a minimal generator 
matrix. Thus, in particular all matrices given below are right invertible. In the forth column 
we present the index of the first column distance that reaches the free distance, cf. (|2.4[) . 
In the last column we indicate whether the code is a cyclic convolutional code in the sense 
of Section At the moment this additional structure is not important. We only want to 
mention that cyclic convolutional codes do not exist for all sets of parameters, in particular 
the length and the characteristic of the field have to be coprime (just like for block codes). 
Moreover, the shortest binary cyclic convolutional codes with complexity 5 > have length 
n = 7 or n = 15. 

The fields being used in the tables are F2 = {0, 1}, F4 = {0, 1, a, a 2 } where a 2 + a + 1 = 0, 
F 8 = {0, 1, /?,... ,/3 6 } where (3 3 + /3 + 1 = 0, and F 16 = {0, 1, 7, . . . , 7 14 } where 7 4 + 7 + l = 0. 

The generator matrix G3 of the (15,4, 12;3)2-code in Table I is given by 



Gs T = 



l + z 2 


l + z + z 3 


z + z 2 


l + z + z 3 


l + z + z 2 


1 + z + z 2 + z 3 


1 + z + z 2 + z 3 


z 


1 + z + z 3 


l + z + z 2 


l + z + z 2 


1 + z 2 + z 3 


z 


l + z + z 3 


1 


l + z + z 2 


z 


z 2 


l + z 


l + z 3 


z 2 


z + z 3 


z 3 


1 + z + z 2 + z 


l + z + z 3 


z 2 + z 3 


z + z 2 + z 3 


z 


z 3 


l + z + z 2 


z + z 3 


z 2 


z + z 2 + z 3 


z + z 2 


l + z 3 


z 2 + z 3 


1 + z + z 2 + z 3 


z 2 + z 3 


z 2 


l + z + z 2 


1 


1 


z + z 2 + z 3 


z 2 


z 2 + z 3 


l + z 


1 





l + z 





1 + z 2 + z 3 


l + z 3 


z 2 + z 3 


l + z 2 + z 3 


z 3 


l + z + z 3 


1 + z 2 + z 3 


z 3 


l + z + z 2 


z + z 2 + z 3 



Some additional explanations and remarks will follow the tables. 
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(n, k,5;m) q 


9 


code meeting the Griesmer bound 




cy 


(5,3,4; 2) a 


6 




1 + z 2 1 + z z 1 + z 2 z + z 2 
1+z z 1+z 1 z 
z 1 1+z 1+z 1 


(not even) 


7 




(5, 2, 6; 3) 2 


12 




V s + z 2 + 1 z 2 + z z 3 + z + 1 z 2 + z z 3 + l" 
z + 1 z 3 + z 2 + 1 z 3 + z 2 z 3 + z + 1 z 2 + z 


(even) 


10 




(5, 2, 6; 4)2 


12 




"1 + z 3 + z 4 1 + z + z 4 1 + z 3 1 + z 2 + z 3 z + z 3 + z 4 
1 + z 2 1+z z 2 + z z 2 + z + l z 2 + z + l 


(even) 


10 




(9,3, 1;1) 8 


8** 




"z + 1 z + P z z + p 2 z + p s z + P g z + 1 z z + P~ 
1 /3 2 /3 5 /3 6 p 6 p 5 p 2 1 
1 /3 2 /3 5 /3 6 p 6 P 5 p 2 1 




1 




(3, 2, 2; 1) 8 


5** 


[2 + 3z 3z 4 + 4z" 
[4 + 2z 1 + 3z 2z 




5 




(7, 3, 3; 1)2 


8 


Gi = 


"1 z 1 + zl + z 1 z 0" 
zl + z 1 + z 1 1 z 
z 1 1+zl+zl+z 


(even) 


2 


X 


(7,3,6; 2)2 


12 


G 2 = 


"1 + z 2 z + z 2 1 + z 1 + z 1 + z 2 z z 2 
z 1 + z + z 2 1 + z + z 2 1 + z 2 1 + z 2 z 
z 2 z + z 2 1 + z 2 1 + z 1 + z + z 2 1 + z 


(even) 


5 


X 


(7, 3, 9; 3)2 


16 




"1 + z 2 + z 3 z + z 2 1 + z + z 3 1 + z 1 + z 2 z + z 3 z 2 + z 3 " 
z 1 + z + z 2 + z 3 1 + z + z 2 1 + z 2 + z 3 1 + z 2 + z 3 z + z 3 
z 2 + z 3 z + z 2 1 + z 2 z 3 1 + z + z 3 1 + z + z 2 + z 3 1 + z _ 


(even?) 


9 


X 


(7,3, 12;4) 2 


20 




"1 + z + z 3 + z 4 1 + z 3 + z 4 1 + z 2 z + z 2 + z 4 1 + z 2 + z 3 z z + z 2 + z 3 + z 4 " 
z 2 + z 3 1 + z + z 2 + z 4 1 + z 4 1 + z + z 2 + z 3 + z 4 z 1 + z + z 3 + z 4 z 2 + z 3 
z 2 + z 4 z 1 + z + z 3 1 + z + z 2 + z 4 1 + z 2 + z 3 + z 4 z 2 + z 3 + z 4 1 + z + z 3 


(doubly even?) 


14 


X 


(15,4,4; 1) 2 


lb 


Gi = 


z z 1+z 1+z 1 1 z 1+z 1+z 1+z 1 
1 z 1 z 1+z 1+z z 1 z 1 1+z 1+z 
1 1 z z z 1 + z z 1 1 + z z 1 1 + z 1 
1+z 1+z 1 z z 1+z 1+z 1 1 z 1+z 


(even) 



z 


X 


(15, 4, 8; 2)2 


24 


G 2 = 


"l + z 2 l + z + z 2 1 + z z z z 2 1 + z z + z 2 l + z + z 2 1 z 2 1 + z z 2 1 + z 2 " 
l + zl + z + z 2 l+z + z 2 1 + z z 2 z z 2 l+z + z 2 z + z 2 z 2 1 1 + z 1 + z 2 
z + z 2 l + z + z 2 l+z + z 2 1 1 + z z + z 2 z 1 z 2 z + z 2 1 1 + z 2 1 + z + z 2 
_l + z z 1 + z 2 1 + z + z 2 1 1 + z + z 2 z z 2 z 2 1 + z + z 2 z 2 1 1 + z z + z 2 

(even?) 


5 


X 


(15, 4, 12; 3)2 


32 


see G3 above, (even?) 




X 





a 
if 


code meeting the Griesmer bound 




cv 
L ,> 


(3, 1,1; 1) 4 


6* 


[a + az, a 2 + az, 1 + az\ 


2** 


X 


(3, 1,2; 2)4 


9* 


[a + az + z 2 , a 2 + az + a 2 z 2 , 1 + az + az 2 ] 


5 


X 


(3, 1,3; 3)4 


12** 


[a + az + z 2 + a 2 z 3 , a 2 + az + a 2 z 2 + z 3 , 1 + az + az 2 + az 3 ] 


7 


X 


(3, 1,4; 4)4 


14 


r 1 ,2,23, 4 2, 22,3, 4 i , 2, 3, 4i 

[a + az + z + a z + az , a + az + a z + z + az , 1 + az + az + az + az \ 


10 


X 


(3, 1,5; 5)4 


16 


[a + az + z 2 + a 2 z 3 + az 4 + az 5 , a 2 + az + a 2 z 2 + z 3 + az 4 + /' , 1 + az + az 2 + az 3 + az 4 + a 2 z h ] 


11 


X 


(5, 2, 2; 1)4 


8 




a + 2 a 2 + a 2 z a 2 +a 2 z a + z 
a + a 2 z z a a 2 + z a 2 + a 2 z 




2 


X 


(5, 2, 4; 2)4 


12 


a + z + az 2 a 2 + a 2 z + a 2 z 2 a 2 + a 2 z + a 2 z 2 a + z + az 2 
a + a 2 z + az 2 z + a 2 z 2 a + a 2 z 2 a 2 + z + az 2 a 2 + a 2 z 


5 


X 


(5, 2, 6; 3)4 


16 


a 2 + a 2 z + az 2 + z 3 1 + az + a 2 z 2 + a 2 z 3 1 + az + a 2 z 2 + a 2 z 3 a 2 + a 2 z + az 2 + z 3 ~\ 
[a 2 + az + az 2 + a 2 z 3 a 2 z + a 2 z 2 + a 2 z 3 a 2 + a 2 z 2 + z 3 1 + a 2 z + az 2 1 + az + z 3 


9 


X 


(O, Z, Z, L)ie 


c* 



\r'+j 4 z 7 3 +7 8 z 7 9 +7 2 z] 
|_7 9 + 7 12 z 7 5 + 7 14 2 7 3 + 7 3 zJ 


Q** 
O 




(3,2,3; 2)i6 


6* 




'j + jz + z 2 7 6 +72 + 7 1() 2 2 7 11 +72 + 7 5 2 2 " 
1 + 2 7 10 +7 5 2 7 5 +7 10 2 




5 


X 


(5,l,l;l)i 6 


10* 


[7 + 72, 7 13 + 7 10 2 , 7 10 + 7 4 2, 7 7 + 7 13 2, 7 4 + 7 1 ' z\ 


2** 


X 


(5,l,2;2)i 6 


15* 


[7 + ^z + 7* 2 , 7 7 + 7^ + 7 10 ^ 2 , 7 13 + 7 13 * + 7V, 7 4 + 7 10 * + 7* V, 7 10 + 7 ? * + 7V] 


3 " 


X 


(5, 1, 3; 3)i6 


20* 


[7 + z + 7 V + z 3 , 7 7 + 7 12 2 + 7 1 V + 7 V, 7 13 + 7 9 2 + r'z 2 + 7 V, 7 4 + fz + 7 14 2 2 + 7 V, 7 10 + y 3 z + 7 8 2 2 + 7 12 2 3 ] 


5 


X 


(5,2,2;l)i 6 


9* 


[7 + 72: 7 13 +7 10 2 7 10 +7 4 2 7 7 + 7 13 2 7 4 +7 7 2 l 
[l + 7 5 2 7 3 + 7 n 2 7 6 + 7 2 2 7 9 +7 8 2 7 12 +7 14 2j 


2** 


X 


(7, 1,1; l)s 


14* 


[p + pz, (3 3 + z, p 5 + p 6 z, 1 + (3 s z, p 2 + (3 4 z, (3 4 + p 3 z, p 6 + p 2 z] 




X 


(7, 1,2; 2) 8 


21* 


[I3 2 + f3z + z 2 ,f3 5 + p 3 z + p fi z 2 ,p + p 5 z + p 5 z 2 ,p 4 + 2 + p 4 z 2 , l+p 2 z + p 3 z 2 ,p 3 + f3 4 z + p 2 z 2 ,p ii + p 6 z + f3z 2 ] 




X 


(7, 1,3; 3) 8 


28* 


[l + f3z+f3 6 z 2 + z 3 , l+p 5 z + p 5 z 2 +p 5 z 3 , l+p 2 z+p 4 z 2 + p 3 z 3 , l+p ti z + p 3 z 2 +Pz 3 , l+0 3 z + /3 2 z 2 + p e 'z 3 , l + z+j3z 2 + f3 4 z 3 , 1+/3 4 z + z 2 +/3 2 z 3 ] 


5 


X 


(7,2,3; 2) 8 


14* 


\l + z + p 4 z 2 p 4 +p 5 z + p 5 z 2 /3 + P 3 z + (3 6 z 2 p 5 +Pz + z 2 p 2 + (3 6 z + Pz 2 p 6 +p 4 z + p 2 z 2 p 3 + p 2 z + p 3 z 2 } 
P + Pz p 3 + z P 5 +P 6 z 1 + P 5 z P 2 +P 4 z P 4 +P 3 z P 6 + P 2 z 


3 


X 



Table III 



(n, k, o; m) q 


.9 


code meeting the Griesmer bound 




cy 


(Pi 3 3- 1 )o 


6 


rnlnniTiQ 1 9 S R fi 7 of ^1 ipvpti 1 


3 




ffi 3 f>- 2li 


10 


mliimns 1 2 4 5 6 7 nf On (ewn\ 

UU1 LXXXXXXo J. . _ . 1 » • } -. v ' . I k_JX v.X ^ I C V CXX / 


3 




C14 4 4- 1 \n 


14 


/->/~il 1 1 tti n a 1 1/1 nf 1 -J 1 in o^roTi i 

CUllxIllIlO _L _L4: Ul IjI lllUL CVtiXX I 


Q 
O 




fl 3 4 4- 1 1o 


13 


poln m n 1 9 4 — 14 nf (tint pvptii 

t_.WXl_XXXXXXo X , _ . X X_ t: vJ X V ' J_ 1 11WI V 1_jXX 1 


3 




(^2 4 4- ilo 


12 


rnlnmrm 1 9 4 — 19 14 of f7i (pvpti^ 


3 




I'm 4 4- iio 


10 


pnliimrm 1 9 4 fi — 1 1 14 of ( pvpti^ 


4 




('8 4 4-1 "li 

^O, 4:, 4:, -L;2 


g 


columns 1 2 4 5 8 11 13 14 of (7i (not pvpn) 

*_.WXl_XXXXXXO A. , £l j , ^ , O , X_ X_ , X_ t: Ul v_. * J_ 1 vj V Ci X I 


4 




fl 4 4 8- 91 1 


22 


pnlninnc 9 1 R /~it (Z<~, ( oi/on / 1 

LUxlxixxixa Zi _LiJ (Jl (^r2 I CvCll : ( 


u 




(13 4 8- 2)9 


20 


columns 1 — 4, 7 — 15 of G2 (even?) 


6 




(12, 4, 8; 2)2 


18 


columns 1, 2, 4, 7 - 15 of G2 (not even) 


6 




(10, 4, 8; 2)2 


16 


columns 1, 2, 4, 5, 7, 8, 10, 11, 13, 14 of G2 (even?) 


7 




(8, 4, 8; 2)2 


12 


columns 1,2,6,9, 12 - 15 of G 2 (even?) 


9 





It remains to explain some additional notation of the tables. We also make some further 
comments illustrating the contents of the tables. 

Remark 4.1 (a) A * attached to the bounds in the second column indicate that these 
numbers are identical to the generalized Singleton bound. Hence the corresponding 
codes are even MDS codes. 

(b) An additional supscript • attached to the bound g indicates that the code is an MDS 
code where the field size reaches the lower bound of Theorem 13 .71 This gives us examples 
for the three cases k = 1, km > 5 + 1, and km = 5. We did not find an example of an 
(n, k, 5) q -MT)S code where km = 5 + 1 and q = n _ d k+1 ■ 

(c) In [3J Prop. 2.3] it has been shown that the jth column distance of an (n, k, <5)„-code 
satisfies dj<(n — k){j + 1) + 1. From this it follows that the earliest column distance 
of an MDS code that can reach the free distance has index M := [f J + |"^r7j] > see IS 
Prop. 2.6]. In the same paper an MDS code is called strongly MDS if the Mth column 
distance is equal to the free distance. We attached a ** to the index of the column 
distance in the second last column of the tables in order to indicate the strongly MDS 
codes. As far as we know no upper bound for the column distances is known that also 
takes the field sizes into account. However, using the estimate dj<(n — k){j + 1) + 1 
one observes that the (5, 2, 2; 1)4- and the (9, 3, 1; l)s-code are also optimal in the sense 
that no code with the same parameters exists where an earlier column distance reaches 
the free distance. We did not investigate whether any of the other codes is optimal in 
this sense. 

(d) We investigated the binary codes with respect of being even, that is, whether all code- 
words have even weight. This can be done by computing the weight distribution (see |17j 
or [HI Sec. 3.10]). Evenness of a code is indicated by an (even) attached to the generator 
matrix. Since the computation of the full weight distribution is very complex for larger 
complexity, we did not fully check the binary codes having complexity bigger than 6. In 



14 



those cases we checked the weight of codewords associated with message words of small 
degree. In case this weight is always even we think there is strong evidence that the 
code is even and attached an (even?) to the generator matrix. In this sense there is also 
evidence that the (7, 3, 12; 4)2-code is doubly even, that is, all codewords have weight 
divisible by 4. Further investigation is necessary in order to understand whether (and 
why) all the binary cyclic convolutional codes of length 7 and 15 are even. 

(c) The second and third code of Table I show that a code meeting the Griesmer bound 
need not have evenly distributed Forney indices. In other words, such a code need not 
be compact in the sense of Theorem I3.3lf b) . For both codes in Table I the free distance is 
attained by the 10th column distance. Only the full weight distribution shows that the 
code with Forney indices 3, 3 is better than the code with indices 4, 2. The first one has 
weight distribution 

W\(T) = 10T 12 + 12T 14 + 71T 16 + 248T 18 + 873T 20 + . . . , 

saying that there are 10 molecular codewords of weight 12 and 12 molecular codewords of 
weight 14, etc. (for the definition of molecular codewords, see ^7j; for weight distributions 
see also [9, Sec. 3.10]). The weight distribution of the second code is 

W 2 (T) = 10T 12 + 27T 14 + 99T 16 + 350T 18 + 1280T 20 + . . . . 

(f) It is worth being mentioned that the codes with parameters (7, 3, 3; 1)2, (7, 3, 6; 2)2, and 
(7, 3, 9; 3)2 form a sequence in the sense that if one deletes z 3 (resp. z 2 ) in the last 
(resp. second) of the according generator matrices then one obtains the previous code. 
The same applies to the codes with parameters (3, 1, 1; 1)4, . . . , (3, 1, 5; 5)4 as well as to 
the (5, 2, 2; 1)4- and (5, 2, 4; 2) 4 -codes. 

(g) The codes with parameters (7, 3, 3; 1)2, (7, 3, 6; 2)2, (15, 4, 4; 1)2 and (15,4, 8; 2)2 are ex- 
tremely robust against puncturing in the sense of cutting columns of the according gen- 
erator matrix (this is not puncturing in the sense of fl(il Sec. 8]). This way we do not 
only obtain right invertible matrices again, but even minimal matrices and, by doing 
this appropriately, codes reaching the Griesmer bound. We have cut one column of the 
codes of length 7 and up to 7 columns of the codes of length 15. The results are given 
in Table III. The only cases where we did not get codes reaching the Griesmer bound 
are for (11,4, 4; 1)2 and for (9,4, 8; 2)2. We do not know if for these parameters there 
exist any codes at all that reach the bound. Since G2(9,4, 4; 1) = 8 = G2(8,4, 4; 1) and 
^2(11, 4, 8; 2) = 16 = £2(10, 4, 8; 2) we skipped in both cases the bigger length. Punc- 
turing the code of length 7 and memory bigger than 2 did not result in a code meeting 
the Griesmer bound. We did not puncture the code of length 15 and memory 3. 

(h) Consider the (8, 4, 4; l)2-code given in Table III. There are other codes with exactly these 
parameters given in the literature. Indeed, in |Hj some (doubly-even self-dual) (8, 4, 4; 1)2- 
codes are presented. Our code is not even, which can easily be seen by writing down the 
generator matrix. We also computed the weight distribution and obtained 

W(T) =11T 8 + 28T 9 + 39T 10 + 101T 11 + 206T 12 + 565T 13 + 1374T 14 + 3033T 15 
+ 7366T 16 + 16984T 17 + 40510T 18 + 95617T 19 + 22348T 20 + . . . , 

which is better than the weight distribution of the self-dual code given in [SJ Eq. (10)]. 
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5 Cyclic Convolutional Codes 



The first two tables of the last section list plenty of optimal codes that we have declared as 
cyclic. Moreover, they gave rise to further sets of optimal codes as listed in Table III. In this 
section we want to briefly describe the notion of cyclicity for convolutional codes. The first 
investigations in this direction have been made in the seventies by Piret ^S] and Roos ^U] • In 
both papers it has been shown (with different methods and in different contexts) that cyclicity 
of convolutional codes must not be understood in the usual sense, i. e. invariance under the 
cyclic shift, if one wants to go beyond the theory of cyclic block codes (see Theorem l5,2l below) . 
As a consequence, Piret suggested a more complex notion of cyclicity which then has been 
further generalized by Roos. In both papers some nontrivial examples of cyclic convolutional 
codes in this new sense are presented along with their distances. All this indicates that the new 
notion of cyclicity seems to be the appropriate one in the convolutional case. Unfortunately, 
the papers |18[ I19j did not get much attention at that time and the topic came to a halt. 
Only recently it has been resumed in Therein, an algebraic theory of cyclic convolutional 
codes has been established which goes well beyond the results of the seventies. On the one 
hand it leads to a nice, yet nontrivial, generalization of the theory of cyclic block codes, on 
the other hand it gives a very powerful toolbox for constructing cyclic convolutional codes. 
We will now give a very brief description of these results and refer to [2| for the details. 

Just like for cyclic block codes we assume from now on that the length n and the field size q 
are coprime. Let F = ¥ q be a field of size q. Recall that a block code C C F n is called cyclic 
if it is invariant under the cyclic shift, i. e. 

(vq, . . .,V n -i) e C => (l>„-l,Uo, • • -,Vn-2) e C (5.1) 

for all (vq, . . . , v n -i) € F n . It is well-known that this is the case if and only if C is an ideal in 
the quotient ring 

n-1 

A := F[x]/^n _ x) = { W m od (x n - 1) 

i=0 

identified with F n in the canonical way via 

n-1 

p:F n — >A, (vo,...,v n -i) i — >^Vix\ 

i=0 

At this point it is important to recall that the cyclic shift in F n translates into multiplication 
by x in A, i. e. 

p(v n -l,V , . . .,V n - 2 ) = Xp(v , ■ ■ ■ ,Un-l) (5.3) 

for all (vq, . . . ,v n -i) £ F™. Furthermore, it is well-known that each ideal / C A is principal, 
hence there exists some g £ A such that I = (g). One can even choose g as a monic divisor 
of x n — 1, in which case it is usually called the generator polynomial of the code p _1 (7) CF". 

It is our aim to extend this structure to the convolutional setting. The most convenient way 
to do so is by using only the polynomial part C n ¥[z] n of the convolutional code C C ¥((z)) n . 
Recall from ()2.2|) that this uniquely determines the full code. Hence imposing some additional 
structure on the polynomial part (that is, on the generator matrix) will also impose some 
additional structure on the full code. In Remark l5.6l below we will see from hindsight that one 
can just as well proceed directly with the full code. The polynomial part of a convolutional 



/ ,...,/n-l GF}, (5.2) 
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code is always a submodule of the free module F[z] n . Due to the right invertibility of the 
generator matrix not every submodule of ¥[z] n arises as polynomial part of a convolutional 
code. It is easy to see Prop. 2.2] that we have 

Remark 5.1 A submodule S C ¥[z] n is the polynomial part of some convolutional code if 
and only if S is a direct summand of F[z] n , i.e. S(BS' = ¥[z] n for some submodule S' C F[z] n . 

In order to extend the situation of cyclic block codes to the convolutional setting, we have 
to replace the vector space F n by the free module ¥[z] n and, consequently, the ring A by the 
polynomial ring 

N 

A[z] := | ^ z j aj N G N , aj G a} 
j=0 

over A. Then we can extend the map p above coefficientwise to polynomials, thus 

N N 

p:F[*f— »'r ; - >»pir ; ). (5.4) 

3=0 3=0 

where, of course, Vj G F n and thus p(i)j) G A for all j. This map is an isomorphism of 
F[z]-modules. Again, by construction the cyclic shift in ¥[z] n corresponds to multiplication 
by x in A[z], that is, we have (|5.3|) for all (vq, • • • ,fn-i) £ F[z] ra . At this point it is quite 
natural to call a convolutional code C C ¥((z)) n cyclic if it is invariant under the cyclic shift, 
i. e. if (|5.1|) holds true for all (vq, ■ ■ ■ ,v n -l) G ¥((z)) n . This, however, does not result in any 
codes other than block codes due to the following result, see p2 Thm. 3.12] and ^3 Thm. 6]. 
An elementary proof can be found at [SJ Prop. 2.7]. 

Theorem 5.2 Let C C F((z)) n be an (n,k, 8) -convolutional code such that 1)5. lj) holds true 
for all (vq : ..., v n -\) £ F[z]". Then 5 = 0, hence C is a block code. 

This result has led Piret ^H] to suggest a different notion of cyclicity for convolutional codes. 
We will present this notion in the slightly more general version as it has been introduced by 
Roos US]. 

In order to do so notice that F can be regarded as a subfield of the ring A in a natural way. 
As a consequence, A is an F-algebra, i. e., a ring and a vector space over the field F and the 
two structures are compatible. In the sequel the automorphisms of A with respect to this 
algebra structure will play an important role. Therefore we define 

Autp(^4) := {cr : A — > A | a\r = idg, a is bijective, a{a+b) = a(a)+a(b) for all a, b G vl}. 

It is clear that each automorphism a G Autp(^4) is uniquely determined by the single value 
cr(x) G A. But not every choice for cr(x) determines an automorphism on A. Since x generates 
the F-algebra A, the same has to be true for u{x) and, more precisely, we obtain for a G A 

cr(x) = a determines an 1 ( 1, a,..., a n_1 are linearly independent over F 



automorphism on A j " ' 1 and a n = 1. (^-^) 

Of course, c(x) = x determines the identity map on A. It should be mentioned that there is 
a better way to determine the automorphism group of A by using the fact that the ring is 
direct product of fields. This is explained in [SJ Sec. 3]. 

The main idea of Piret was to impose a new ring structure on A\z\ and to call a code cyclic 
if it is a left ideal with respect to that ring structure. The new structure is non-commutative 
and based on an (arbitrarily chosen) automorphism on A. In detail, this looks as follows. 
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Definition 5.3 Let a G Aut F (A). 

(1) On the set A[z] we define addition as usual and multiplication via 

N M N+M 

y~] z j a,j -^2z i bi= £ ^2 ^(^o^ for a11 N > M G N ° and a ^ ^ e A - 

3=0 1=0 t=0 j+l=t 

This turns A\z\ into a non-commutative ring which is denoted by ^4[z;cr]. 

(2) Consider the map p : ¥[z] n — * A[z; a] as in (|5.4|1 . where now the images p(v) = ^2jLo z ^p{ v j) 
are regarded as elements of A[z; a]. A direct summand <S C ¥[z] n is said to be a-cyclic if 
p(«S) is a left ideal in A[z; a]. 

(3) A convolutional code C C ¥([z)) n is said to be a-cyclic if C H F[z] n is a cr-cyclic direct 
summand. 

A few comments are in order. First of all, notice that multiplication is determined by the 
rule 

az = za(a) for all a S A (5-6) 

along with the rules of a (non-commutative) ring. Hence, unless a is the identity, the inde- 
terminate z does not commute with its coefficients. Consequently, it becomes important to 
distinguish between left and right coefficients of z. Of course, the coefficients can be moved 
to either side by applying the rule (|5.6|) since a is invertible. Multiplication inside A remains 
the same as before. Hence A is a commutative subring of A[z; a]. Moreover, since cj|f = idp, 
the classical polynomial ring ¥[z] is a commutative subring of A[z; a], too. As a consequence, 
A[z; a] is a left and right F[z]-module and the map p : ¥[z] n — > A[z; a] is an isomorphism of 
left F[z]-modules (but not of right F[z]-modules). In the special case where a = id^ the ring 
^4[z;a] is the classical commutative polynomial ring and we know from Theorem 15.21 that no 
cr-cyclic convolutional codes with nonzero complexity exist. 

Example 5.4 Let us consider the case where F = F2 and n = 7. Thus A = ¥[x]/ ,7 

Using 1)5.5(1 one obtains 18 automorphisms, also listed at |191 p. 680, Table II] (containing 
one typo: the last element of that table has to be x 2 + x s + x 4 + x 5 + x G rather than 
x + x 3 + x A + x 5 + x 6 ). 

Let us choose the automorphism a € Aa.t$(A) defined by a(x) = x 5 . Furthermore, we 
consider the polynomial 

g := 1 + x 2 + x 3 + x 4 + z(x + x 2 + x 3 + x 5 ) G A[z; a] 

and denote by '(g) := {fg \ f G A[z;(t]} the left ideal generated by g in ^[zjct]. Moreover, 
put S := p^Cig)) C ¥[z] 7 . We will show now that S is a direct summand of F[z] 7 , hence 
S = C n ¥[z] 7 for some convolutional code C C F((z)) 7 , see Remark 15.11 In order to do so we 
first notice that 

'(g) = span F[z] {sr, xg, ... ,x 6 g} 

and therefore 

S = {uM I u G ¥[z} 7 } where M = 



p-Hg) 

P 1 (xg) 
P^(x & g) 
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Thus we have to compute x l g for i = 1, . . . , 6. Using the multiplication rule in ()5.6|) we obtain 

xg = x + x 3 + x 4 + x 5 + z(l + x + x 3 + x 6 ), 
x 2 g = x 2 + x 4 + x 5 + x 6 + .z(x + x 4 + x 5 + x 6 ), 
x 3 # = 1 + x 3 + x 5 + x 6 + z(x 2 + x 3 + x 4 + x 6 ) 
= 9 + x 2 g. 

Since x 3 g is in the F-span of the previous elements, we obtain '(g) = spanipr 2 ]{g, xg, x 2 g} 
and, since p is an isomorphism, 

S = {uG\u EF[z] 3 }, 

where 









"1 


z 


1+z 


1 + z 1 


z 


G = 


p-^xg) 




z 


1 + z 





1 + z 1 


1 z 




_p- 1 (x 2 g)_ 




_0 


z 


1 


1 + z 


1+z 1+z 



One can easily check that the matrix G is right invertible. Hence S is indeed a direct summand 
of ¥[z] 7 and thus we have obtained a cr-cyclic convolutional code C = imG C F((z)) 7 . This is 
exactly the (7, 3, 3; l)2-code given in Table I of the last section. 

The other cyclic convolutional codes in Tables I and II are obtained in a similar way. Since 
the underlying automorphism cannot easily be read off from the generator matrix of a cyclic 
convolutional code we will, for sake of completeness, present them explicitly in the following 
table. All those codes come from principal left ideals in A[z; a] and, except for the codes with 
parameters (3, 2, 3;2)ie, (5, 2,2;l)ig, (7, 2, 3;2)g, the generator polynomial can be recovered 
from the given data by applying the map p to the first row of the respective generator matrix. 
The generator matrices of the remaining three codes are built in a slightly different way. In 
those cases each row of the given matrix generates a 1-dimensional cyclic code and thus each 
of those three codes is the direct sum of two 1-dimensional cyclic codes. In each case a 
generator polynomial of the associated principal left ideal is obtained by applying p to the 
sum of the two rows of the respective generator matrix. 



Table IV 



(n, k, 5; m) q -code of Tables I and II 


automorphism given by 


(7, 3, 3m; rn)2, m = 1, . . . , 4 


cr(x) — x 


(15,4,4; l) a 


a(x) = x + x 7 + x 10 


(15, 4, 4m; m)2, m = 2, 3 


a(x) =x 3 + x s + x 7 + x 10 + x 12 + x 13 + x 1 * 


(3, 1,5; 5) 4 , 5=1,..., 5 


<j(x) = a 2 x 


(5, 2, 2m; m)4, m = 1, 2, 3 


a{x) — x 2 


(3,2,2;1) 16 and (3,2,3;2) 16 


&(x) = 7 10 x 


(5, 1, 5; 5)i 6 , 5 = 1, 2, 3 and (5, 2, 2; 1) 16 


a(x) = x 3 


(7, 1,5; 5) 8 , 5 = 1,2 and (7,2,3;2) s 


cix) — x 5 


(7,1,3;3) 8 


a(x) =[3x + fix 2 + [3 3 x 3 + (3 3 x A + (3 3 x 5 + (3 2 x 6 



The fact that all the cyclic convolutional codes above come from principal left ideals in A[z; a] 
is not a restriction since we have the following important result. 
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Theorem 5.5 Let a G Autf(yl). If S C F[z] n is a a-cyclic direct summand, then p(S) is 
a principal left ideal of A[z\o\, that is, there exists some polynomial g G A[z;cj] such that 
p(S) = '(g). We call g a generator polynomial of both S and the a-cyclic convolutional code 
C C F((z)) n determined by S, see Remark \5.1\ and (|2.2jl . 



The generator polynomial of a a-cyclic convolutional code can be translated into vector 
notation and leads to a generalized circulant matrix. This looks as follows. Let S C ¥[z] n be 
a cr-cyclic direct summand and let p(«S) = '(g). Define 



p-Hg) 
P l {xg) 

p- l (x n ~ l g) 



€¥[z] 



Then it is easy to see that p(uM a (g)) = p{u)g for all u G ¥[z] n (see Prop. 6.8(b)]) and 
therefore, S = {uA4 a (g) | u 6 F[z] n |. We call A4 a (g) the cr-circulant associated with g. 

Remark 5.6 Using the identities above we can now easily see that cr-cyclic structure can 
also be considered without restricting to the polynomial part. Just like the polynomial ring 
A[z] we can turn the set A((z)) of formal Laurent series over A into a non-commutative ring by 
defining addition as usual and multiplication via Ij5.6|) . We will denote the ring obtained this 
way by A((z;a)). Furthermore, we can extend the map p to Laurent series in the canonical 
way, see also ([5.4)1 . Then one can easily show that just like in the polynomial case 

p(uM a (g)) =p{u)g for all u G ¥{(z)) n 

for each g G -Afzjo - ]. Using the fact that a code C C ¥((z)) n is uniquely determined by its 
polynomial part (see (|2.2jl ). and that the latter is a principal left ideal in A[z;cr] due to 
Theorem 15.51 one can now derive the equivalence 



CC¥( 



is cr-cyclic 



p(C) is a left ideal in A((z;a) 



Moreover, if C is cr-cyclic, a generator polynomial of the ideal p(C n F[z] n ) in A[z; a] is also a 
principal generator of the ideal p(C) in A((z; a)) . This justifies to call g a generator polynomial 
of the full code C as we did in Theorem 15.51 



At this point the question arises as to how a (right invertible) generator matrix can be 
obtained from the cr-circulant A4 a (g). Notice that in Example 15.41 the generator matrix of 
the code is simply given by the first three rows of the circulant. This is indeed in general 
the case, but requires a careful choice of the generator polynomial g of the code. Recall 
that, due to zero divisors in A[z;a], the generators of a principal left ideal, are highly non 
unique. The careful choice of the generator polynomial is based on a Grobner basis theory 
that can be established in the non-commutative polynomial ring ^4[z;cr]. This is a type of 
reduction procedure resulting in unique generating sets of left ideals which in turn produce 
very powerful cr-circulants. The details of this theory goes beyond the scope of this paper and 
we refer the reader to [H] for the details, in particular to jSJ Thm. 7.8, Thm. 7.18]. Therein 
it has been shown that a reduced generator polynomial also reflects the parameters of the 
code, i. e., the dimension and the complexity, and even leads to a minimal generator matrix 
through cr-circulants. Only with these results it becomes clear that cyclic convolutional codes 
can have only very specific parameters (length, dimension, and complexity) depending on the 
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chosen field ¥ q . Furthermore, the notions of parity check polynomial and associated parity 
check matrix have been discussed in detail in [5] , leading to a generalization of the block code 
situation. 

As for the cyclic codes of the last section we only would like to mention that their generator 
polynomials obtained as explained right before Table IV are all reduced in the sense above. 

So far we do not have any estimates for the distance of a cyclic convolutional code in terms of 
its (reduced) generator polynomial and the chosen automorphism. The examples given in the 
last section have been found simply by trying some promising reduced generator polynomials 
(using the algebraic theory of 5 ). Except for the puncturing in Table III we did not perform 
a systematic search for optimal codes. 

Conclusion 

In this paper we gave many examples of cyclic convolutional codes that all reach the Griesmer 
bound. The examples indicate that this class of convolutional codes promises to contain 
many excellent codes and therefore deserves further investigation. As one of the next steps 
the relation between the (reduced) generator polynomial and the automorphism on the one 
hand and the distance on the other hand should be investigated in detail. 
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